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1 RAID: high-performance, reliable secondary storage 

|& Peter M. Chen, Edward K. Lee, Garth A. Gibson, Randy H. Katz, David A. Patterson 
^ June 1994 ACM Computing Surveys (CSUR), Volume 26 issue 2 

Publisher: ACM Press 

Additional Information: full citation, abstract, references, citings, index 
terms, review 



Full text available: *P| pd:73.6Q M3) 



Disk arrays were proposed in the 1980s as a way to use parallelism between multiple 
disks to improve aggregate I/O performance. Today they appear in the product lines of 
most major computer manufacturers. This article gives a comprehensive overview of disk 
arrays and provides a framework in which to organize current and future work. First, the 
article introduces disk technology and reviews the driving forces that have popularized 
disk arrays: performance and reliability. It discusses the tw ... 



Keywords: RAID, disk array, parallel I/O, redundancy, storage, striping 
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Full text available: ■ p pdttS29 .53 KB1 

terms 

This paper compares different indexing techniques proposed for supporting efficient 
access to temporal data. The comparison is based on a collection of important 
performance criteria, including the space consumed, update processing, and query time 
for representative queries. The comparison is based on worst-case analysis, hence no 
assumptions on data distribution or query frequencies are made. When a number of 
methods have the same asymptotic worst-case behavior, features in the methods tha ... 

Keywords: I/O performance, access methods, structures, temporal databases 
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June 2005 Proceedings of the 32nd Annual International Symposium on Computer 
Architecture ISCA 05 

Publisher: IEEE Computer Society 

Full text available: *g ) pd: r (269.80 KB) Additional Information: full citation, abstract 

The traditional approach for characterizing complex systems is to run standard workloads 
and measure the resulting performance as seen by the end user. However, unique 
opportunities exist when characterizing a system that is itself constructed from 
standardized components: one can also look inside the system itself by instrumenting 
each of the components. In this paper, we show how intra-box instrumentation can help 
one understand the behavior of a large-scale storage cluster, the EMC Centera. I ... 

4 Fortran 8X draft B 
^ Loren P. Meissner 

December 1989 ACM SIGPLAN Fortran Forum, Volume 8 issue 4 

Publisher: ACM Press 

Full text available: ^ pdf(21.36 MB) Additional Information: full citation, abstract, index terms 

Standard Programming Language Fortran. This standard specifies the form and 
establishes the interpretation of programs expressed in the Fortran language. It consists 
of the specification of the language Fortran. No subsets are specified in this standard. The 
previous standard, commonly known as "FORTRAN 77", is entirely contained within this 
standard, known as "Fortran 8x". Therefore, any standard-conforming FORTRAN 77 
program is standard conforming under this standard. New features can b ... 
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Achim Kraiss, Gerhard Weikum 

August 1998 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 7 Issue 3 

Publisher: Springer-Verlag New York, Inc. 

Full text available: ^pdft601.01„KBJ Additional Information: fuJJ. citation, abstract, citings, index terms 

Large multimedia document archives may hold a major fraction of their data in tertiary 
storage libraries for cost reasons. This paper develops an integrated approach to the 
vertical data migration between the tertiary, secondary, and primary storage in that it 
reconciles speculative prefetching, to mask the high latency of the tertiary storage, with 
the replacement policy of the document caches at the secondary and primary storage 
level, and also considers the interaction of these policies with ... 

Keywords: Caching, Markov chains, Performance, Prefetching, Scheduling, Stochastic 
modeling, Tertiary storage 
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Publisher: ACM Press 
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Full text available: W[ pdf( 828.46 K3) 

Data sets in large applications are often too massive to fit completely inside the 
computers internal memory. The resulting input/output communication (or I/O) between 
fast internal memory and slower external memory (such as disks) can be a major 
performance bottleneck. In this article we survey the state of the art in the design and 
analysis of external memory (or EM) algorithms and data structures, where the goal is to 
exploit locality in order to reduce the I/O costs. We consider a varie ... 
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Keywords: B-tree, I/O, batched, block, disk, dynamic, extendible hashing, external 
memory, hierarchical memory, multidimensional access methods, multilevel memory, 
online, out-of-core, secondary storage, sorting 



Algorithms 

Eran Gal, Sivan Toledo 

June 2005 ACM Computing Surveys (CSUR), volume 37 issue 2 
Publisher: ACM Press 

Full text available; ^fidfQ«...39 KB) Additional Information: Mlcitaiipn, abstract, references, jnjdex terms 

Flash memory is a type of electrically-erasable programmable read-only memory 
(EEPROM). Because flash memories are nonvolatile and relatively dense, they are now 
used to store files and other persistent objects in handheld computers, mobile phones, 
digital cameras, portable music players, and many other computer systems in which 
magnetic disks are inappropriate. Flash, like earlier EEPROM devices, suffers from two 
limitations. First, bits can only be cleared by erasing a large block of memory. S ... 

Keywords: EEPROM memory, Flash memory, wear leveling 



Liba Svobodova 

December 1984 ACM Computing Surveys (CSUR), Volume 16 issue 4 
Publisher: ACM Press 

Additional Information: Mcjtation, references, cffings, indexjerms, 



Full text available: *m pdf(4,23 M3) 

review 



Consistent and automatic replica regeneration 
Haifeng Yu, Amin Vahdat 

February 2005 ACM Transactions on Storage (TOS), Volume l issue i 
Publisher: ACM Press 

Full text available: ^ pdf(372.24 K8) Additional Information: full citation, abstract , references, index terms 

Reducing management costs and improving the availability of large-scale distributed 
systems require automatic replica regeneration, that is, creating new replicas in response 
to replica failures. A major challenge to regeneration is maintaining consistency when the 
replica group changes. Doing so is particularly difficult across the wide area where failure 
detection is complicated by network congestion and node overload. In this context, this 
article presents Om, the first read/write peer- ... 

Keywords: Peer-to-peer storage systems, availability, consistency, regeneration, 
replication 
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David Lomet, Betty Salzberg 
^ June 1989 ACM SIGMOD Record , Proceedings of the 1989 ACM SIGMOD international 
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We present an access method designed to provide a single integrated index structure for 
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a versioned timestamped database with a non-deletion policy. Historical data (superceded 
versions) is stored separately from current data. Our access method is called the Time- 
Split B-tree. It is an index structure based on Malcolm Easton's Write Once B-tree. The 
Write Once B-tree was developed for data stored entirely on a Write-Once Read-Many or 
WORM optical ... 

1 1 Distributed^ 

^ Roger S. Chin, Samuel T. Chanson 

March 1991 ACM Computing Surveys (CSUR), volume 23 issue l 

Publisher: ACM Press 

Additional Information: MLcftatjon, abstract, references, citings, Index 



Full text available: Tp| pdfl2.97 MB) 

^ terms, review 

The development of distributed operating systems and object-based programming 
languages makes possible an environment in which programs consisting of a set of 
interacting modules, or objects, may execute concurrently on a collection of loosely 
coupled processors. An object-based programming language encourages a methodology 
for designing and creating a program as a set of autonomous components, whereas a 
distributed operating system permits a collection of workstations or personal 
computers ... 

Keywords: capability scheme, distributed operating systems, error recovery, method 
invocation, nested transaction, object model, object reliability, object-based programming 
languages, processor allocation, resource management, synchronization, transaction 
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Keywords: clustered storage, data storage, fault tolerance, scalability 



15 Distributed file systems: concepts and exampies 
Eliezer Levy, Abraham Silberschatz 

December 1990 ACM Computing Surveys (CSUR), volume 22 issue 4 
Publisher: ACM Press 

Additional Information: full citation, abstract, references, citings, index 



Full text available: W pdf(5.33 MB) 

^ terms, review 

The purpose of a distributed file system (DFS) is to allow users of physically distributed 
computers to share data and storage resources by using a common file system. A typical 
configuration for a DFS is a collection of workstations and mainframes connected by a 
local area network (LAN). A DFS is implemented as part of the operating system of each 
of the connected computers. This paper establishes a viewpoint that emphasizes the 
dispersed structure and decentralization of both data and con ... 

16 OceanStore: an architecture for global-scale persistent storage 

John Kubiatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patrick Eaton, Dennis Geels, 
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November 2000 ACM SIGARCH Computer Architecture News , ACM SIGOPS Operating 
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Systems ASPLOS-IX, Volume 28 , 34 Issue 5 , 5 

Publisher: ACM Press 

Full text available: fB pdff 166.53 K8) Additional ,nformation * McitatiorL abstract references, cffiDia index 

terms 

OceanStore is a utility infrastructure designed to span the globe and provide continuous 
access to persistent information. Since this infrastructure is comprised of untrusted 
servers, data is protected through redundancy and cryptographic techniques. To improve 
performance, data is allowed to be cached anywhere, anytime. Additionally, monitoring of 
usage patterns allows adaptation to regional outages and denial of service attacks; 
monitoring also enhances performance through pro-active movement ... 

17 Concepts and capabilities of a database computed 
Jayanta Banerjee, David K. Hsiao, Richard I. Baum 

^ December 1978 ACM Transactions on Database Systems (TODS), volume 3 issue 4 

Publisher: ACM Press 

Additional Information: full citation, abstract, references, citings , index 



Full text available: W P.df(2.79 MS) 

.terms 

The concepts and capabilities of a database computer (DBC) are given in this paper. The 
proposed design overcomes many of the traditional problems of database system 
software and is one of the first to describe a complete data-secure computer capable of 
handling large databases. This paper begins by characterizing the major problems facing 
today's database system designers. These problems are intrinsically related to the nature 
of conventional hardware and can only be solved by i ... 

Keywords: clustering, content-addressable memory, database computers, keywords, 
mass memory, performance, security, structure memory 
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Full text available: ^pdtX1.47.M3} Additional Information: MLcliatjon. abstract, references, Index terms 

OceanStore is a utility infrastructure designed to span the globe and provide continuous 
access to persistent information. Since this infrastructure is comprised of untrusted 
servers, data is protected through redundancy and cryptographic techniques. To improve 
performance, data is allowed to be cached anywhere, anytime. Additionally, monitoring of 
usage patterns allows adaptation to regional outages and denial of service attacks; 
monitoring also enhances performance through pro-active movement ... 

19 Petal: distributed virtual disks 

Edward K. Lee, Chandramohan A. Thekkath 
^ September 1996 ACM SIGPLAN Notices , ACM SIGOPS Operating Systems Review , 

Proceedings of the seventh international conference on Architectural 
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Publisher: ACM Press 

Full text available* f§ pdf(1 10 MB) Additional Information: fuJIcitatjoa, abstract, references, citings, index 
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The ideal storage system is globally accessible, always available, provides unlimited 
performance and capacity for a large number of clients, and requires no management. 
This paper describes the design, implementation, and performance of Petal, a system that 
attempts to approximate this ideal in practice through a novel combination of features. 
Petal consists of a collection of network-connected servers that cooperatively manage a 
pool of physical disks. To a Petal client, this collection appear ... 
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